When and why are log-linear models self-normalizing?
نویسندگان
چکیده
Several techniques have recently been proposed for training “self-normalized” discriminative models. These attempt to find parameter settings for which unnormalized model scores approximate the true label probability. However, the theoretical properties of such techniques (and of self-normalization generally) have not been investigated. This paper examines the conditions under which we can expect self-normalization to work. We characterize a general class of distributions that admit self-normalization, and prove generalization bounds for procedures that minimize empirical normalizer variance. Motivated by these results, we describe a novel variant of an established procedure for training self-normalized models. The new procedure avoids computing normalizers for most training examples, and decreases training time by as much as factor of ten while preserving model quality.
منابع مشابه
Normalized Log-Linear Interpolation of Backoff Language Models is Efficient
We prove that log-linearly interpolated backoff language models can be efficiently and exactly collapsed into a single normalized backoff model, contradicting Hsu (2007). While prior work reported that log-linear interpolation yields lower perplexity than linear interpolation, normalizing at query time was impractical. We normalize the model offline in advance, which is efficient due to a recur...
متن کاملMonitoring Multinomial Logit Profiles via Log-Linear Models (Quality Engineering Conference Paper)
In certain statistical process control applications, quality of a process or product can be characterized by a function commonly referred to as profile. Some of the potential applications of profile monitoring are cases where quality characteristic of interest is modelled using binary,multinomial or ordinal variables. In this paper, profiles with multinomial response are studied. For this purpo...
متن کاملOn the Accuracy of Self-Normalized Log-Linear Models
Calculation of the log-normalizer is a major computational obstacle in applications of log-linear models with large output spaces. The problem of fast normalizer computation has therefore attracted significant attention in the theoretical and applied machine learning literature. In this paper, we analyze a recently proposed technique known as “self-normalization”, which introduces a regularizat...
متن کاملInspecting the mechanism: closed-form solutions for asset prices in real business cycle models
In this paper we derive closed-form solutions for a variety of prices for financial assets in an RBC economy. The equations are based on a log-linear solution of the RBC model and allow a clearer understanding of the determination of risk premia in models with production. We demonstrate not only why the premium of equity over the risk-free rate is small but also why the premium of equity over a...
متن کاملEvaluation of prognostic factors affecting long and short term survival rates of Hodgkin's lymphoma patients using the cure fraction models
Background and Aim: This study aimed to analyze the factors affecting time and experience of relapse in the patients with Hodgkin's lymphoma, using cure fraction. Material and Methods: This retrospective study included all the patients diagnosed as Hodgkin's lymphoma in the Center for oncology and hematology in Shafa Hospital in Ahwaz City from 2002 to 2012. We used survival analysis and cure f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015